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Abstract. We prove the Kato conjecture for degenerate elliptic operators on R". 
More precisely, we consider the divergence form operator C w = — u; divAV, where 
w is a Muckenhoupt Ai weight and A is a complex- valued n x n matrix such that 
w A is bounded and uniformly elliptic. We show that if the heat kernel of the 

associated semigroup e~ tCm satisfies Gaussian bounds, then the weighted Kato 

1 li 

square root estimate, \Lw /||l 2 (>) ~ ||V/||/,2(„,), holds. 



1. Introduction 

The purpose of this work is to give a positive answer to the Kato square root 
problem for a class of degenerate elliptic operators, under the assumption that the 
associated heat kernel satisfies classic Gaussian upper bounds. 

Before stating our results, we briefly sketch the background. Given a uniformly el- 
liptic, nxn complex matrix A, define the second-order elliptic operator £ = — divAV. 
Then the square root £}l 2 can be defined using the functional calculus. The origi- 
nal Kato problem was to show that for all / in the Sobolev space H 1 , ||£ 1//2 /|| i 2 « 
||V/||l2. This was first posed by Kato [23] in 1961, but only solved in the past decade 
in a series of remarkable papers by Auscher, et al. [3, 4, 20]. Initially, they solved 
the problem given the additional assumption that the heat kernel of the semigroup 
e~ tCw satisfied Gaussian bounds. Such estimates were known to be true in the case 
A was real symmetric, but it had been shown that they need not hold for complex 
matrices in higher dimensions [2]. The final proof omitted this hypothesis. For a 
more complete history of this problem, we refer the reader to the above papers or to 
the review by Kenig [25]. 

We have extended this result to the case of degenerate elliptic operators, where 
the degeneracy is controlled by a weight in the Muckenhoupt class A 2 . We say that 
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a weight w (i.e., a non-negative, locally integrable function) is in A 2 if 

where the supremum is taken over all cubes Q C M n . Given w G A 2 and con- 
stants A, A, < A < A < oo, let S n (w, X, A) denote the class of n x n matrices 
A = (Aij (x))™ - =1 of complex- valued, measurable functions satisfying the degenerate 
ellipticity condition 



(1.1) 



Xw (x) K| 2 < Re (A£, = ReZl 3=1 Aj (x) 
^ (A^rj)\<Aw(xM\\rj\ 

for all £, 77 G C n . 

Given A G S n (w, X, A), define the degenerate elliptic operator in divergence form 
C w = — w~ 1 divAV. Such operators were first considered by Fabes, Kenig and Ser- 
apioni [15] and have been considered by a number of other authors since. (See, for 
example, [7, 8, 9, 10, 16, 26, 27].) It is a natural question to extend the Kato problem 
to these operators: that is, to show that 

for all / in the weighted Sobolev space Hq(w). (Exact definitions will be given below.) 
We consider this problem in the special case that the heat kernel of the semigroup 
e -tc w sa tisfies Gaussian bounds. More precisely, we assume there exists a heat kernel 
Wt(x,y) associated to the operator e~ tCw such that for all / G 



C ) 



1.2) e~ tc -f{x)= I W t (x,y)f(y)dy. 



Furthermore, for all t > and x, y G R n , the kernel W t satisfies the Gaussian bounds 
(1-3) \W t (x,y)\<^e W (-C 2 la 



and the Holder continuity estimates 

(1.4) \W t (x + h, y) - W t (x, y) \ + \W t (x,y + h) - W t (x, y) \ 

< Ci ( \h\ y 



\x - y\ 
t 

where h G M n is such that 2\h\ < t l l 2 + \x — y\. The constants C±, Ci and fi depend 
only on n, w, A, and A. If these three properties hold we will say that e~ tCw satisfies 
Condition (G). 

Our main result is the following theorem. 



THE KATO PROBLEM 
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Theorem 1.1. Given w £ A 2 and A £ S n (w,X,A), suppose that e~ tCw satisfies 
Condition (G). Then there exists a positive constant C = C (n, A, A, w) such that for 
all f in Hq(w), 

(i-5) liv/iu^c- 1 < \\C l J 2 f\\ LHw) < C\\Vf\\ L * {w) . 

To prove Theorem 1.1 it actually suffices to prove the second inequality, 
(1-6) \\C]i 2 f\W {w) <C\\Vf\\ L , {w) . 

For suppose this inequality holds. Since A £ £ (n, A, A, w) implies A* £ £ (n, A, A, w), 
(1.6) holds for (Cw 2 )* = (C^) 1 ^ 2 . (This operator identity follows from the functional 
calculus, for instance, from (5.2).) Therefore, by the ellipticity conditions 

HV/||| 2H = / \Vf(x)\Mx)dx 

< A _1 Re f AVf(x) ■ V/(x) dx 
=A~ 1 Re / C w f(x)f(x)w(x)dx 
=\- 1 Re [ C 1 J 2 f(x)(C 1 J 2 )*f(x)w(x)dx 

Our proof of inequality (1.6) follows the outline of the proof of the classical Kato 
problem with Gaussian bounds in [20]. (See also the expository treatment in [19].) 
There are four main steps: in Section 5 we reduce (1.6) to a square function inequality; 
in Section 6 we show that this inequality is a consequence of a Carleson measure 
estimate; in Section 7 we prove a weighted Tfe-theorem for square roots; finally, in 
Section 8 we construct the family of test functions need to use the Tb theorem to 
prove the Carleson measure estimate. Prior to the proof itself, in Sections 2 and 3 we 
give some preliminary results about degenerate elliptic operators, Gaussian bounds 
and weighted norm inequalities. And in Section 4 we prove two weighted square 
function inequalities needed in our proof. The first, in particular, is central, since 
it is the replacement for the (much simpler) Fourier transform estimates used in the 
unweighted case. 

Throughout, all notation is standard or will be defined as needed. The letters C, c 
will denote constants whose value may change at each appearance. Given a function 
/ and t > 0, define f t (x) = t~ n f(x/t). Given an operator T on a Banach space X, 
let ||T|| B (x) denote the operator norm of T. 
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2. Degenerate Elliptic Operators 

The properties of the degenerate elliptic operator C w and the associated semigroup 
e -tc w were developed in detail in [11] and we refer the reader there for complete 
details. Here we state the key ideas. 

Given a weight w G A 2 , the space Hq(w) is the weighted Sobolev space that is the 
completion of Cf with respect to the norm 

H/lkV)= ( I {\f(x)\ 2 + \Vf(x)\ 2 )w(x)dx 
Given a matrix A G S n (w, A, A), define a(f, g) to be the sesquilinear form 

(2.1) a(f,g)= f A{x)Vf\x)-VgJx)dx. 

Since w G A 2 and A satisfies (1.1), a is a closed, maximally accretive, continuous 
sesquilinear form. Therefore, there exists a densely defined operator C w on L 2 (w) 
such that for every / in the domain of C w and every g G L 2 (w), 

(2.2) a(f,g) = {£wf, g)w = / C w f(x)g(x)w(x)dx. 

If /, g are in and in the domain of C w , then integration by parts yields 

(C w f,g) w = a(f,g) = (£f,g), 

where ( , ) is the standard complex inner-product on L 2 . Thus, at least formally, 
C w = w~ l C = -iu _1 divAV. 

Further, the properties of the sesquilinear form a guarantee that the semigroup 
e -tCw exists. In the special case when A is real and symmetric, then the heat kernel 
of e~ tCw satisfies Condition (G). 

Finally, in [11] we proved the following results which will be needed in our proof. 

Lemma 2.1. Given a matrix A G S n (w, X, A) , suppose that the heat kernel of the 
associated semigroup e~ tCw satisfies Condition (G). Then for all t > 0, e~ tCw l = I: 
that is, for all x G R n , 

/ W t (x,y)dy = l. 

If e~ tCw satisfies Gaussian bounds, then its derivative satisfies similar bounds. More 
precisely, let V t = —ItC-we' 1 Cw = j- t e~ l Cw . Then the following result holds. 

Lemma 2.2. The operator Vt has a kernel Vt(x,y) with the following properties: 
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For all x, y el" and t > 0, 
(2-3) \V t (x,y)\<^e^(^-cJ-^^j 

For almost every i 6 I", V t l = 0, that is, for all x G R n , 

V t (x,y)dy = 0. 



There exists a = a(n, A, A, w) > such that for almost every x, y, G R n ; 2 \ h\ < 
t + \x\. 



\V t (x,y)-V t (x,y+h)\+\V t (x+h,y)-V t (x,y)\ < £^ ( -f^ ) exp ( -C 2 " 









t + 1 


.r 



F - 2/1 



f 2 



3. Weighted Norm Inequalities 



Central to our proof is the theory of weighted norm inequalities for classical opera- 
tors, particularly singular integrals and square functions. In this section we state the 
results we need; the standard ones are given without proof and we refer the reader 
to Duoandikoetxea [13], Garcfa-Cuerva and Rubio de Francia [17], and Grafakos [18] 
for complete information. 

We begin with the weighted norm inequalities for the Hardy-Littlewood maximal 
operator, for convolution operators, and for singular integrals. 

Lemma 3.1. Let w G Ai. Then M is bounded on L 2 (w) and ||M||b(l 2 (») < 
C(n, [w]a 2 ). Furthermore, suppose <fi and $ are such that for all x, \4>{x)\ < 
and $ is radial, decreasing and integrable. Then the operators <p t * f are uniformly 
bounded on L 2 (w); in fact, 

sup|^*/(z)| <C(n)||$||iM/(x). 
t>o 

Remark 3.2. If is a Schwartz function then such a function <3> always exists. 



Lemma 3.3. Let K : R™\{0} — > R be a locally integrable function such that K G L c 
and 

C 



\VK{x)\ < J^, Xy^O 



\x 



Then the singular integral 

Tf(x)= i K(x-y)f(y)dy 
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and the maximal singular integral 

T*f(x) = sup 



e>0 

2 



K(x-y)f(y) dy 

\x-y\>e 



are bounded on L {w), and ||T|| B (l 2 (»), ||^*||b(l 2 (uj)) < C(n, \w\a 2 ,K). 
Remark 3.4. An important example of singular integrals are the Riesz transforms: 

R j f(x) = c n [ Xj ~l j f(y)dy, l<j<n, 

Jr" f — y\ 



where the constant c n is chosen so that 



Our next two results are square function inequalities. The first is a weighted version 
of Carleson's theorem due to Journe [21]. Define the weighted Carleson measure norm 
of a function 7t by 

1 f f m dt 

(3.1) IM|c> = sup —— / / \~it(x)\ 2 —w(x)dx. 

Q w{Q) J Q J t 

Lemma 3.5. Let w G A 2 and suppose j t is such that ||7t||o < oo. Let p e C^°(lR n ) 
be such that p is a non-negative, radial, decreasing function, supp(p) C i?i(0), and 
\\p\\i = 1. Then for all f G L 2 (w), 

|fe*/)(x)| 2 | 7i (x)| 2 ^)dx^<C|| 7t || c ,J|/||i 2w . 

The second result is a Littlewood-Paley type inequality. 

Lemma 3.6. Given w G A 2 , let ip be a Schwartz function such that ifj(0) = 0. Then 
for all f G L 2 (w), 

r f°° dt 

(3.2) / / \^*fXx)\ 2 -w(x)dx<C(n^,[w} A2 )\\f\\h iw y 
Jr" Jo 1 

Proof. A direct proof of Lemma 3.6 is given by Wilson [30]. Here we sketch a proof 
that is implicitly based on the idea that 



/ r°° dt 



1/2 



can be regarded CIS db vector- valued singular integral. 
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By a standard argument in the theory of weighted norm inequalities, it will suffice 
to prove that for all < 5 < 1 there exists a constant C$ such that for x G M n , 

(3.3) M*(g4f) s )(x)<C 5 Mf(x) 5 , 

where M is the Hardy-Littlewood maximal operator and M* is the sharp maximal 
operator of Fefferman and Stein. 

The proof of inequality (3.3) is readily gotten by adapting the argument in Cruz- 
Uribe and Perez [12, Lemma 1.6] for the g\ operator. The changes are straightfor- 
ward, so here we only indicate the key steps. (Also see Alvarez and Perez [1].) Our 
assumptions on ip guarantee that g$ is bounded on L 2 and is weak (1, 1). (See [17, 
p. 505].) Therefore, we only have to prove that if \x\ > \h\/2, then 

/ r°° dt\ 1/2 I/; I 1 / 2 

(3.4) / \^ t {x + h)-Mx)\ 2 4) <Ci ' 



t J ~ \x\ n +V 2 ' 

This is the vector- valued analog of the gradient condition in [12, Lemma 1.6]. To 
prove (3.4), note that since ip is a Schwartz function it is bounded. Hence, by the 
mean value theorem, for each x and t there exists 9, < 9 < 1, such that 

dt f°° , , fx + h\ ,/£\,n dt 



Mx + h)-Mx)\ 2 j= / I^FrW(f) 



t 2n+l 



< C\h\ J°° \v<p ( 

= C\h\ f +C\h 



t J r V t J 1 t 2n+1 
x + 9h\ , dt 



\ t J l t 2n + 2 



Since \x + 9h\ > \x\/2 and |V^(y)| < C\y\ 



-2n-2 



1^(^)1^ < CWj [ \x\-'"-' dt = C 



2n-2 



xl 2n+1 



-2n 



We estimate the second integral in the same way, using that |Vt/>(x)| < C\x\ 
Taking the square root we get (3.4). □ 



The next proposition is a key estimate in our proof of Theorem 1.1. It yields 
a square function estimate given size and regularity assumptions on the kernel of 
the operator. In the unweighted case, this result can be found in, for example, 
Grafakos [18, p. 643] or Hofmann [19]; in a somewhat different form it can be found 
in Auscher and Tchamitchian [5]. 
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Proposition 3.7. Let w G A 2 and let ip be a radial Schwartz function such that 
?(0) = and 

r°° - dt 

(3.5) j Q m 2 j = i- 

Let Qtf{x) = ipt * f{x)- Given a family of sublinear operators {Rt}, suppose that 
each Rt is bounded on L 2 (w), and for all t, s > the composition RtQ s is bounded 
on L 2 {w) and for some a > 0, 

(3.6) \\ R t Q s \\ B(L 2 M) < K mm . 

Then the family {Rt} satisfies the square function estimate 
r°° r M 

(3.7) / / \R t f(x)\Mx)dx-<K-C(n^,a,[w} A Mf\\li iw y 

JO jR n t 

The proof of Proposition 3.7 requires a weighted version of the Calderon reproduc- 
ing formula given by Wilson [30]. 

Lemma 3.8. For all w G A 2 and f G L 2 {w), 

r°° dt 

I j = /(*), 

where this equality is understood as follows: for each j > 1, let Bj be the ball centered 
at of radius j , and define the function 

r j dt 
m = / Qt{x Bj Qtf){x)-. 

Then for each j , fj G L 2 (w) and {fj} converges to f in L p (w). 

Proof of Proposition 3.7. Fix / G L 2 (w) and let fj be as in Lemma 3.8. Since R t is 
bounded on L 2 (w), we have that for each t > 0, 

/ \R t f(x)\ 2 w(x) dx = lim / \R t fj(x)\ 2 w(x) dx. 

Since each R f is sublinear, we have that 



\Rtfj(x)\ < / \RtQs{xB 3 Q s f){x)\-. 
Ji/i s 



Therefore, by Fatou's lemma, Minkowski's inequality, and (3.6), 
\R t f(x)\ 2 w(x) dx^- 
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f°° f dt 

< liminf / / \R t fj(x)\ 2 w(x) dx — 

j^oo J J Rn t 

< liminf [ ( f iRtQsixB.QsfKx)]-) w(x)dx^ 

< limmf^ 00 ^ ^ \R t Q s ( X B,Qsf)(x)\ 2 w(x)dxy 2 j 

,oc / n / t s y ds\ 2 dt 

< liminf K/ / mm -,- Wxb.QsJ l 2 («0 — — 

Jo \Ji/j \s tj s J t 

,00 / ,00 / t y ds\ 2 dt 

For all s > 0, 

f°° . ft s\ a dt f°° . / l\ a du „, , 
/ mm -, - — = / mm \ u, — — = C(a) < 00, 
Jo \s'tj t J \'uj u K 1 

and the same is true if we reverse the roles of s and t. Therefore, if we apply Schwartz' 
inequality, Fubini's theorem and Lemma 3.6 we get that 

00 ' ro ° 't s\ a .„ dsVdt 



vo 



min(-,- \\Qsf\\ L > H - j 



< C(a) [ 
Jo 



. ft s\ a ds\ f r°° .ft s\ a .. n 2 d S \ dt 

00 ' ^ / t s\ Q dt\ „„ „„ 2 



< C(a,ip,n, [w]A 2 )\\f\\ 2 L 2 {w) . 

□ 

Operator norm bounds such as those in (3.6) can generally be deduced in the 
unweighted case using the Fourier transform or kernel estimates. We will make use 
of the following result from Grafakos [18, Theorem 8.6.3]. 

Lemma 3.9. Let {T t }, t > be a family of integral operators such that T t l = and 
such that the kernels K t satisfy 

(3.8) \K t (x,y)\< 



+1 



t n (l +t~ 1 \x - y\) n 
(3-9) \K t (x,y)-K t (x,y')\<^^ 
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Then for some a > 0, 

II^I|b(l 2 ) < Cmin 

In order to get this estimate on L 2 (w), w G A2, we use the following clever appli- 
cation of interpolation due to Duoandikoetxea and Rubio de Francia [14]. 

Lemma 3.10. Suppose that a sublinear operator T is bounded on L 2 (w) for all w G 
A 2 , with 1 1 T 1 1 b(l 2 (u>)) depending only on [w]a 2 an d dimension n. Then for any w G A 2 , 
there exists 9, < 9 < 1, that depends on [w]a 2 such that, 

\\T\\b{l*(w)) < C(n, [w} A2 )\\T\\ e B{L 2y 

Proof. This is a consequence of the structural properties of A 2 weights and the theory 
of interpolation with change of measure due to Stein and Weiss [28] (see also Bergh 
and Lofstrom [6]). Given w G A 2 , there exists s > 1, depending only on [w]a 2 , such 
that w s G A 2 with [w s ]a 2 depending only on [w]a 2 - Hence, by hypothesis HTHg^^s)) 
is bounded by a constant that depends only on [w]a 2 and n. Choose 6 such that 
1 — 9 — 1/s. Then w = l 9 w s ^~ e \ so by interpolation with change of measure, 

I \T\ I B(L*(w)) < \\T\\b(L2(ws))\\ T \\b(.L2) < [ w ]A 2 )\\T\\ B(L 2y 

□ 

4. Two Square Function Inequalities 

In this section we prove two weighted square functions inequalities. The first is for 
the operator V t = ^e~* Cw and is used in Sections 6 and 7 below. 

Lemma 4.1. Let p G be a non-negative, radial, decreasing function such that 
\\p\\i = 1 and supp(p) C -£>i(0), and for f G H 1 (w) let 

Gt(x, y) = f{y) - f{x) - (y - x) ■ (p t * V/)(x). 

Then there exists C > depending only on n, w and the constants in the Gaussian 
estimates, such that 

w{x)dx^<C\\Vf\\l 2[w) . 

The second square function inequality is needed in the last step of the proof in 
Section 8. In the unweighted case this inequality is due to Journe [22]; our proof is 
adapted from that of Auscher and Tchamitchian [5] as explicated by Grafakos [18]. 




(4.1) 



V t {x,y)G t {x,y)dy 
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Define the averaging operator A t by 

A t f(x) = -f f(y)dy, 

JQt(x) 

where Qt(x) is the unique dyadic cube containing x such that t < £(Qt(x)) < It. If 
2~ fc ~ 1 < t < 2~ k , and Q k denotes the collection of dyadic cubes of side length 2~ k , 
then the kernel of A t is 

j 

It follows immediately from the definition of A t that |^4j/(x)| < Mf(x), and so by 
Lemma 3.1, A t is bounded on L 2 (w) for all w £ A2 with a constant independent of t. 

Define the operator P t f(x) = p t * f(x), where pt{x) = t~ n p(x/t) and p is a non- 
negative, radial function such that supp(p) C -Bi(O) and \\p\\i = 1. Then again by 
Lemma 3.1, \P t f(x)\ < C(n)Mf(x) and P t is uniformly bounded on L 2 (w), w £ A 2 . 

Lemma 4.2. Given w £ A2, there exists a constant C such that for all f £ L 2 (w), 

(4.2) [°° I \(P t -A t )f(x)\ 2 w(x)dx^<C [ \f(x)\ 2 w(x)dx. 

JO JR n t J«. n 

Remark 4.3. Though Lemma 4.2 is stated in terms of the dyadic grid, it will be clear 
from the proof that it is true if we replace the dyadic grid by the "dyadic" grid relative 
to a fixed cube Q: that is, the collection of cubes gotten as in the construction of the 
standard dyadic grid, but starting with Q instead of [0, l) n . 

Proof of Lemma 4.1. The proof requires several steps. First, we will show that 
there exists a family of sublinear operators {R h }, k > 0, such that (4.1) holds provided 
that there exists A > 1 such that 

f°° f Ht 

(4.3) / / \R k F(x)\ 2 w(x)dx-<CA k \\F\\ 2 L2{w) , 

Jo JR n 1 

where 

n 

F(x) = R-Vf(x)=J2 R i 

i=i 

and the Rj are the Riesz transforms. This square function estimate will follow from 
Proposition 3.7 if we can prove that the operators R k are uniformly bounded on 
L 2 (w) and satisfy two operator norm estimates. Let ip be a radial Schwartz function 
such that ^(0) = and such that (3.5) holds, and let Q s f = ip s * j '. We will first 
show that for all s, t > 0, 

(4-4) \\RtQs\\B(LH w )) < C(n, [w) A2 , if)) 
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and then show the stronger estimate 

(4.5) \\RtQs\\B(LHw)) < C(n, {w] A2 ,P,*P)A k min 




Reduction to (4.3). By Lemma 2.2, 



V t (x,y)G t (x,y)dy 
< at-*- 1 } exp (-C a J^n \G t (x,y)\dy 
= C x r n - X J ^ ^exp (-cj-^^j \G t (x,y)\dy 

k=1 J2 k - 1 t<\x-y\<2 k t V 1 J 



\G t (x,y)\dy. 



In the first integral, make the change of variables h = (y — x)/t; in the integrals 
in the sum, make the change of variables h = (y — x)/2 k t. Then there exist positive 
constants B\ and B 2 such that 



V t (x,y)G t (x,y)dy 
Note that for any B 2 > 



oo » 

< fi 1 r 1 ^2 nfc exp(- J B 2 4 fc ) / |G t (x,x + 2 fc /it)|c//i. 



^2 nfc exp(- J B 2 4 fc ) < oo. 



k=0 



Therefore, by Holder's inequality we get the following estimate: 




V t (x,y)G t (x,y)dy 



w(x) dx 



dt 



[r 1 V2" fc exp(- J B 2 4 fc ) / 

°° poo r { r 

<C , V2 rifc exp(-5 2 4 fe ) / / (r 1 / 



|Gf(x, x + 2 fc /it)| d/i w(x)dx 



\G t (x,x + 2 k ht)\ dh) w(x)dx 



dt 
T 
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if we make the change of variables 1 1— ► 2~ k t = t^, we get 

oo 

< C , ^2 (n+2)fc exp(-5 2 4 fe ) 

fc=0 

(4.6) 

dt 



x / It 1 / \G tk (x, x + ht)\ dh \ w(x)dx- 

Jo JR n V J\h\<l J t 

Assume for the moment that for each k there exists a family of sublinear operators 
{Rt } such that 

(4.7) R k t F{x) = r 1 [ \G tk (x, x + ht)\ dh, 

J\h\<l 

where F = R-Vf, and that there exists A > 1 such that (4.3) holds. By Lemma 3.3, 
||-^||l 2 («;) < C(n, [w]a 2 )\\ V/||l 2 («)), and so if we combine (4.6) and (4.3) we get in- 
equality (4.1). 

To construct the operators R% and show that (4.7) holds, recall that the the Riesz 
potential Ii is the convolution operator with kernel i\(x) = c n ,|x| 1_n , where the 
constant c n is chosen so that 

The following identities are well-known (see, for instance, Stein [29]): if / is a 
Schwartz function and F = R ■ V/, then / = I\F. Further, given any h G M n , 
h-Vf = -R-(hF). 

Define the convolution kernel 

J* (a;) = r 1 + ht) - ; 

then we have that 

r'G tk (x, x + ht) = f( x + ht )~f( x ) _ ptk , {h . v/) {x) 

= {J t h * F) (x) + Ptk * (R ■ (hF))(x) = Ht h F(x). 

Therefore, if we define 

R k t F{x)= [ \H* h F(x)\dh, 

J\h\<l 

then (4.7) holds. And, since the operators H* h are linear, each R% is sublinear. 
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Proof of inequality (4.4). Since by Lemma 3.1 the operators Q s are uniformly 
bounded on L 2 (w), to prove (4.4) it will suffice to prove that for all t and k, 

(4-8) \\R^\\ B{L2{w)) <C(n,[w} A2 ^). 

By definition, for all Schwartz functions g we have that 



\R?g(x)\< / \(J?*g)(x)\dh- 

\h\<l 



\h\<l 



\p tk *(R-(hg))(x)\dh. 



To prove (4.8) we will prove that each term on the right-hand side is uniformly 
bounded on L 2 (w). The boundedness of the second follows immediately from Lem- 
mas 3.1 and 3.3: since ||»||i = 1, for all k and t we have that 



\h\<i 
< C{n 



\p tk * (R ■ (hg))(x)\ dhj w(x)dx 

\pt k * (R ■ (hg))(x)\ 2 w(x) dxdh 
\R ■ (hg)(x)\ 2 w(x) dxdh 



'\h\<l JW 1 

< C(n, [w]a 2 



\h\<i Jm n 

< C{n, \w\a 2 ) / / \hg(x)\ 2 w(x) dxdh 

J\h\<l JR n 



<C(n,[w} A2 ) \g(x)\ 2 w(x)dx. 
We will now prove that the first term is bounded on L 2 (w): 



(4.9) 



h\<l 



\Jt * 9{x)\ dh w(x) dx < C(n, [w) A2 )\\g\\ 2 L2(w) . 



To prove this we first estimate the inner integral on the left-hand side: 

J t +c n (n-l)—— I X{\x/t\>2}) *g{x) 



\Jt*g(x)\dh< 

\h\<l J\h\<l 



dh 



h\<l 



h ' x 



\x\ 



< 



J t h + c n {n - 1) 



h ■ x 



\h\<l 

n 



\x 



n+l 



X{\x/t\>2} 



dh 

dh) * \g\(x) 



\X 



n+l 



X{\ x /t\>2} *g(x) 
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n 

<(L t *\g\)(x) + C(n)J2^9(x), 

8=1 

where 

f h • t 

dh, 



Ux) 



Ji(x) + c n (n - l)-^j^X\x\>2(x) 



\h\<i 

c n is the constant in the Riesz potential and R* is the maximal singular integral 
associated with the Riesz transform FL. 



By Lemma 3.3, 

P n 

I ^ D*J^|2„,./_\ J„xri/„ L..1 Ml„l, 

\L 2 {w)- 



Y,R:g(x)\M*)dx < C(n, MaJIM 12 



i=l 

Therefore, to complete the proof of (4.9) we need to show that 

|2„../„\ j„ ^ /-f/„ L..1 Ml „I|2 



\(L t * \g\)(x)\ w(x) dx < C(n, [w] A2 )\\g\\ L2{w) . 
But by Lemma 3.1 it will suffice to prove that 
(4.10) | L(x) | < C{n) min 



L^ln— 1 ' l^l^+l 



since the right-hand side is a radial, decreasing function in L 1 . To prove this estimate 
we treat several cases depending on the size of x. 

CASE 1: \x\ > 2. In this case, since h ■ Vii(x) = c n (n — l)h ■ x/|x| n+1 , if we apply 
the mean value theorem twice we get that 



\x 



\L(x)\ =c n I \\x + hi 1 - 11 - Id 1 "" + (n - l)^L\dh 
l\ h \<i 



<C(n)[ r^-rdlK^ 

— v 7 /, It n+1 — \rn nH 
J\h\<l \X\ \X\ 



'\h\<l 

Case 2: \x\ < 2. In this case we have that 

oo „ 

(4.11) \L(x)\<J2 ^(x + h) -U(x)\dh. 

Fix k. Then we consider the following sub-cases. 

Sub-case 2.1: |x| > 2~ fc+1 . In this case, \x\/2 > \h\. Hence, \x + h\ > \x\/2, so 

r r\—kn 

(4.12) / \ii{x + h) -ii(x)\dh<C(n) A 



{2- k - 1 <\h\<2~ k } 



\X 



n-1 " 
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Sub-case 2.2: |a;| < 2~ k ~ 2 . In this case, we have that \x + h\ > \h\/2 > 2~ k ~ 2 , so 
we can argue as in the previous case to get (4.12). 

Sub-case 2.3: 2~ k ~ 2 < \x\ < 2~ k+1 . In this case we estimate as follows : 
/ \ii{x + h) - ii(x)\ dh = c n \ \x + h\ l " n - \x\ l ~ n \ dh 

J{2~ k - 1 <\h\<2- k } J{2~ k - 1 <\h\<2- k } 

nA + Ai^-iidh 



X \ TL 1 -I {2- k - 1 <\h\<2- k } 



if we make the change of variables u = h/\x\, we get 

i\ l - n - l\du 





\x 


ll 


\x\ 


Ti- 


-1 



\x\ 

kn 



\X 



n-1 



\X 



n-1 





1 X 


1 ^ i 2~ k i 

<l"l< m } 


\ x \ 


X 






+ u\ 


;|u|<4} \ x \ 




+ 





< I \\- +u \l-n_ 1 \ du 



11 du, 



{i<l"l<4} 



where the last equality holds by rotational symmetry. Note that this last integral is 
finite and its value depends on n but is independent of k and x. 

If we apply these three sub-cases to (4.11), we get that for all |x| < 2, 

\m\<± c YK 

fc=0 11 11 

This completes our proof of (4.10), and so of (4.9). Therefore, we have shown that 
inequality (4.8) holds. 

Proof of Inequality (4.5). By Lemma 3.10 and inequality (4.4), to prove (4.5) it 
will suffice to show the corresponding unweighted norm estimate: 

(4.13) \\R k Q s \\ B(L2) <C(n,[w} A2 ,p^)A k mm(^,^j . 

Fix s, t > 0, k > 0, and recall that Q s f = ip s * f. By Holder's inequality and 
Plancherel's theorem, 

\R k Q s g(x)\ 2 dx 

<C{n) [ [ \4*i/j s *g(x)+p th *{R-{h(i> s *g))(x)\ 2 dxdh 

J\h\<l JR n 
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C(n) 



C{n) 



\h\<l JR" 



h\<l 



d!; dh 



2<itt\t\ 



\m)?m)\ 2 ^dh 



To estimate the last term, we will use the following: since ip is a Schwartz function 
such that ^(0) = 0, 

^CWmind^U^r 1 ). 

Since p G C^°, 

C{p) C{p)2 k 



tk\Z\ ' t\Z\ 



And, since p(0) = 1, 



e 

Therefore, we have that 



uh< -l-2mp(U)Z-h = 0(m 2 ). 
e 2nith < - 1 - 2nip{t k £)£ ■ h 



<C(p)2 k mm(\t^\t^\- 1 ). 



2vrf|£| 

Combining these estimates we get that 
\R k Q s g(x)\ 2 dx 

<C{n,p^)A k [ [ mind^U^r^'mind^U^I- 1 ) 2 !^)! 2 ^^ 

J\h\<l Jl" 

<C(n,p^)A k mm( S -,-) [ [ \g(0\ 2 d£dh 
< C(n,p,tp)A k mm ( -, - J / \g(x)\ 2 dx. 

\t S J J R n 

Inequality (4.13) follows immediately. This finishes the proof of Lemma 4.1. 

Proof of Lemma 4.2. For brevity, let Rt = Pt — A t . Then we have that 

R t f = R t {I-P t )f + R t P t f, 

so it will suffice to prove that (4.2) holds with R t replaced by R t (I — Pt) and RtPt- 
By our remarks above, it is clear that both of these operators are uniformly bounded 



18 DAVID CRUZ-URIBE, SFO AND CRISTIAN RIOS 

on L 2 {w). Therefore, by Lemma 3.10 and Proposition 3.7 it will suffice to show that 
for all s, t > 0, there exist constants C, a > such that 



(4.14) \\R t {I-P t )Q s \\ L 2<Cmm 

(4.15) \\RtPtQ,\\i? < Cmin 



t s 
s't 
t s 
s't 

where Q s f — ip s * f(x) with ip a radial Schwartz function with ^(0) = 0. 
We first prove (4.14). Since R t is uniformly bounded on L 2 , 

\\Rt(I - Pt)Qs\\ L * < C\\(I - P t )Qs\\v>. 

We can bound the right-hand side using Plancherel's theorem. By our choice of ip 
and since p(0) = 1, 

i^oi \i - m)\ < cm. 

Therefore, 

i(7-p t )o./(x)i 2 d a ;= I \{i-m)\ 2 \m) 2 \\lm 2 <^ 

<c(-) 2 [ \Kt)\ 2 dt = c(t) 2 \\f\\i 

W Jr™ \sj 



and so 

\\Rt{i - Pt)Qsh* < C (lj . 

On the other hand, since convolution operators commute and I — Pt is uniformly 
bounded on L 2 , 

\\Rt(I - P t )Q,\\ L ' = \\RtQ s (I - Pt)\\v < C\\R t Q s \\ L 2. 
For any a, < a < 1/2, there exists a constant C such that 

\\A t Q s \\ L 2<c(^y . 

(See Grafakos [18].) Further, since 

mo\<&y \m)\<c\ S £\, 

we can again use Plancherel's theorem to see that 

■s 

.7 



\P t Qs\\v<C^ 
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It follows that for a < 1/2 

\\RtQ s \\L*<c(^)\ 

and so we get that (4.14) holds. 

To prove (4.15) we will apply Lemma 3.9. It will suffice to show that R t P t (l) = 0, 
and that the kernel K t of the operator RtPt satisfies (3.8) and (3.9). The identity is 
immediate: both A t and P t are bounded on L°° and A t (l) = Pt(l) = 1. 

Let K t = J t + L t , where J t is the kernel of P t 2 and L t is the kernel of A t P t , that is, 

Jt( x ,v)= Pt(x - z)p t (y - z)dz, 
Jm. n 

Jr" ■ 3 3 

It is immediate from these expressions that there exists a constant c > such that 
both Jt(x, y) and L t (x, y) are non-zero only if \x — y\ < ct. Further, we have that 

\J t (x,y)\<t- n \\ P \\ L o \\p\\ L i < cr n . 

Similarly, since \Q^\ ~ t~ n , 

\L t (x,y)\<Ct- n \\p\\ L oo. 

Inequality (3.8) follows at once. 

The proof of (3.9) is similar. By the mean value theorem, 

\Pt(y-z)-p t (y'-z)\ <t- n \\Vp\\ L ~ 

If we use this to estimate \ J t (x,y) — J t (x,y ; )\ and \L t (x, y) — L t (x,y')\ and argue as 
before, we get that both satisfy a similar bound. Inequality (3.9) follows at once. 
Therefore, we have proved (4.15) and our proof is complete. 



y-y 



t 



C\y-y' 



t 



n+l 



5. Reduction to a Square Function Estimate 

In this section we begin our proof of Theorem 1.1 by proving that inequality (1.6) 
holds if we have the square function estimate 

(5.1) ( T f \V t f(x)\ 2 w(x)dx^) 12 <C\\Vf\\ L2(w) , 

where V t = — 2tC w e~ t2/Zw . Fix / G D(C W ); recall that D(C W ) is dense in Hl(w) (see 
[11]). If we apply integration by parts to a well known formula (see, for instance, 
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Kato [24]) we get that 



(5-2) C'J 2 f 
Therefore, for all g G L 2 (w), 



3-2t 2 C w r l - 1 



t 



dt 



te- t2c ™£ w f,?£* w e-^g 



2 r * „-t 2 c:„ 



dt 



< 



te-* C -C w f(x)\' 2 w(x) 



dt 

T 



\t 2 C* w e-^g(x)\ 2 w(x)^ 

Hence, by duality we have shown that (1.6) follows from (5.1) provided that we 
can prove the square function inequality 



dt 



Lw 9{x)\ z w{x)-<C\\g\\i Hw) . 

1 b 

Since the semigroups e~ tc * w and e~ tCw satisfy the same estimates, this is equivalent 
to proving that 



(5.3) 



dt 



\tV t g(x)\ 2 w(x)-<C\\g\\l 2{w) . 



To prove this square function estimate we use Proposition 3.7. Let G(x) 
exp(— C2|x| 2 ). Then by Lemma 2.2, 



\tV t f(x)\ < / \tV t (x,y)\\f(y)\dy 



< d / t~ n exp -a 



f - y\ 
t 2 



\f(y)\dy = C 1 (G t *\f\)(x), 



Since G G L 1 and is radial, by Lemma 3.1, sup t>0 (G t * < CMf(x) and the 

operators W t are uniformly bounded on L 2 (w). Let %jj be a radial Schwartz function 
such that "0(0) = 0. For s > 0, let Q s = ip s * f. Again by Lemma 3.1 we have that 
the operators Q s are uniformly bounded on L 2 (w). Therefore, there exists a constant 
C such that for all s, t > 0, 



\tV t Q s \\i3(L 2 (w)) < G. 
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Further, by Lemmas 2.2 and 3.9 there exists (3 > such that 

t 



\\tVtQs\\j3(L2) < Cmin v 
Hence, by Lemma 3.10 we have that for some a > 0, 

\\tV t Q s \\ B ( L 2 {w)) < Cmin 0, . 
Therefore, the operators {tVt} satisfy the hypotheses of Lemma 3.7, so (5.3) holds. 

6. Reduction to a Carleson Measure Estimate 

In this section we prove that (5.1) holds provided that we have a Carleson mea- 
sure estimate. More precisely, we will show that if w G A2, then for all Schwartz 
functions /, 

(6-1) / / l^/(x)rHx)rfx-<L7(l + || 7t || Ci J||V/||i 2(w) , 

JO JR n 1 

where j t = V t (f>, with <f>(x) = x. 

To prove this, we first show the role played by the Carleson measure estimate. 
Let p G be a non-negative, radial, decreasing function such that ||p||i = 1 and 
supp(p) C .Bi(O). By Lemma 2.2, V t l = 0, so 



VJ(x)= I V t (x,y)f(y)dy 

V t (x,y)ydy\ ■ (p t * Vf)(x) 
+ / V t (x,y)\f(y)-f(x)-(y-x)-(pt*Vf)(x)]dy 
(6.2) =lt(x)-{p t *Vf)(x)+ [ V t {x,y)G t (x 7 y)dy, 



where G t (x, y) = f(y) — f(x) — (y — x) ■ (p t * V/)(x). The first term in (6.2) satisfies 
a square function estimate. This follows from the Carleson measure estimate: if 
||7t||c> < 00 ) then by Lemma 3.5, 

r°° r dt 

The proof of (6.1) now follows from Lemma 4.1. 



22 DAVID CRUZ-URIBE, SFO AND CRISTIAN RIOS 

7. The Weighted Tb Theorem for Square Roots 

We have reduced the proof of Theorem 1.1 to proving that / -f t (x) = Vt<f>(x) is a 
Carleson measure with respect to the weight w (x): that is, 

1 f f e{Q) dt 
(7.1) \\lt\\c,w = sup — 77^/ / \lt (x) | 2 — w (x) dx < oo, 

Q w (Q) J Q J t 

where j t (x) = t£ w e~ t2Cw (p (x) and ip (x) = x. 

In order to prove this fact we will establish a Tb theorem for square roots, a 
weighted version of a result due to Auscher and Tchamitchian [5]. For technical 
reasons we actually need a slightly different theorem that is given in Lemma 7.2 
below. However, it seemed clearer to start with this simpler version and then sketch 
the modifications needed to prove the full result. 

Lemma 7.1. Suppose that for every cube Q there exists a mapping Fq : 5Q — > C n 
and a constant C — C (n, A, A, w) < oo such that 



\VF Q {x)\ 2 w{x)dx < Cw (Q), 



5Q 



(iii) / \C W F Q (x)\ 2 w (x) dx < Cw (Q) £ {QY 2 , 
fiv) 



Then 



if 2 dt 

\\lt\\ Cw < c { SU P^7^ / / \lt{x) VP t F Q (x)\ — (x) + sup ||7 t | 
[ Q w{Q) Jq Jo 1 t>o 

where Fq = (Fi, F 2 , . . . , F n ) G C n , and VFq is the matrix (—Fj\ 

\ Xi / i,j=l 

||7t|lct« — C (^, A, A,w;) < oo, i.e., 7 f is a Carleson measure with respect to w. 

Proof. We follow the proof of the unweighted lemma in [19]. Since the kernel of V t 
satisfies the Gaussian bounds (2.3), we have that 

(7.2) sup ||t*IIoo <C(n,X,A,w) < oo, 

t>o 

Indeed, since Vt has zero moment, 

It (x) = V t (p (x) = / V t (x, y)y dy = - V t (x, y) (x - y) dy; 
Jm. n jR n 

thus, applying the Gaussian estimates, 
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(7-3) \ lt Or) I < £ ex P (-^^^) I* " f I ^ 

and this bound is independent of t. Hence, given (iv) to prove (7.1) we only need to 
show that 

1 f f e{Q) dt 

(7.4) sup—— / / \ lt (x)VP t F Q {x)\ 2 — w(x)dx<C(n,X,A,w) < oo. 

Q w [Q) Jq Jo t 

Recall that P t g = p t * g with p t (x) = t _n p(|), p 6 Cq° (Si (0)) is radial and 
Jp(x)dx = l. Let x (z) G <T (M n ) satisfy 

supp(x)c4Q, 0<x<l, x| 3Q = l, |Vx| <Cl{Q)-\ 

Since p is supported in the unit ball and the gradient operator commutes with P t , 
for x G Q and < £ < £ (Q) we have that 

(7.5) VP t F Q (x) = P t VF Q (x) = [ Pt (y) {VxF Q ) (x - y) dy 

JBt(O) 

= P t WxF Q (x) = PtVx (Fq (x) - c ) , 

where c is a constant to be fixed below. Let Fq = Fq — c ; then (7.4) is equivalent 
to 



1 r rt(Q) 



w(Q) 



Q JO 



(7.6) sup — t— — / / j t (x) P t VxFQ (x) — w (x) dx < C (n, A, A, w) < oo 



2 dt 



t 



Recall that \V t = -tC w e~ eLw = -te~ t2Cw C w = te'^ ^ w~ x div AV. Define 9 t 



2 

■ 

identity matrix, ^ t = 9 t V(p = 9tl, so for x e Q and < t < £ (Q) we can write 



2te t2Cw w 1 divA; then 9* acts on n x n matrix valued functions. If 1 is the n x n 



lt (x)VP t xF Q (x) = e t l{x)-P t {VxF Q ){x) 

= [9 t l (x) ■ P t - 9 t ] (VxFq) (x) + d t {VxFq) (x) 

= R t (yxF Q ){x) + e t (yxF Q ){x), 



where Rt = 9 t l (x) Pt — 9 t . We claim that 



I r r*-{Q) / ~ \ 2 dt C f 

( 7 - 7 ) -77T\ / / Rt[VxF Q )(x) -w(x)dx<—— \VF Q \ 2 w(x)dx. 
w{Q) JqJo v j t w (Q) J 5Q 
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If this is the case, then by assumption (ii) 

1 f f m f - \ *dt 

Rt ^xFq (x) —w (x) dx < C < oo. 



w(Q) 




t 



Furthermore, since the operator e f2Cw is a contraction in L 2 (w), by (iii), we have 
1 




4 



O t [VxFqJ {x) —w (x) dx 




< 



w(Q) j 
4 



e * Cw C w \Fq (x) tdtw (x) dx 



\C w Fq (x)| 2 w (x) dxtdt 



W(Q) J0 J5Q 

MQ) 

< C£(Qy 2 / tdt 

Jo 

< C < oo. 

Therefore, to complete the proof we only have to show that (7.7) holds. Since 
fV t (x,y)dy = 0, 



Rt(VxF Q ) (x) 

9 t l (x) ■ P t {VxFq) (x) - 9 t (Vx^q) (x) 
-2te- t2c ™£ w <f) (x) • P t (VxFq) 0) - 2te~ t2c ™£ w xF Q (x) 

V t {x, y) y dy] P t (VxF Q ) (x) + I V t {x, y) {xF Q ) (y) dy 



V t (x,y) [(xFq) (V) ~ (xFq) (x) -(y-x) P t (v%Fq) (x)] dy 



By Lemma 4.1, 



(7- 




V t (x,y)G t (x,y)dy. 



2 dl 

Rt ( Vx^q ) {x) —w (x) dx 




V t (x,y) Gt[F Q ) (x,y) dy 



—w (x) dx < C (n, [w]a 2 ) 2 



VxF q 



L 2 (w) 
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We estimate the right hand side using the product rule and the weighted Poincare 
inequality (see [15]); if we fix c = f 5Q F Q (x) dx, then 

\\VxF Q \\ 2 L2(w) = [ VxF Q (x) 2 w(x)dx 



C 

< 



/ \Fq{x) — Co | 2 w{x) dx + C j |VFq(x)| 2 w{x) dx 

J5Q J5Q 



t 2 (Q) 

C C i^) I \VF Q (x)\ 2 w(x)dx + C I \VF Q (x)\\ 

'5Q J5Q 



t 2 (Q) 

< C [ \VF Q {x)\ 2 w{x)dx. 



'5Q 

This proves (7.7) and our proof is complete. □ 

In our proof we actually need to replace (iv) with a more complicated criterion; 
for ease of reference we record this as a separate lemma. 

Lemma 7.2. Suppose that there exists a finite index set {u} with cardinality N = 
N (n, \,A,w), such that for each cube Q there are mappings Fq v : 5Q — > C and a 
constant C = C (n, A, A, w) < 00 satisfying 



(I) / \VF Q>v (x)\ 2 w(x)dx<C w(Q), 
(II) f \C w F QtU {x)\ 2 w{x)dx<Cw{Q) t{Q)- 2 , 

(III) \\ lt \\l <Csu P \\ lt \\l + Cj2^S / / \lt(x)VP t F Q , l/ (x)\ 2 -w(x)dx. 

Then \\^t\\c w — ^ ( n ' ^> ^ w ) < oc: ^- e -> 7* ^ s a Carleson measure with respect to w. 

The proof is essentially identical to the proof of Lemma 7.1. By (7.3), it is enough 
to show that 

1 f f e{Q) 2 dt 

SU P — 77W / / l7t ( x ) VP t F QiU (x) I — w(x) dx <C (n, A, A, w) < 00, 

q w (Q) JqJo t 

and this is done exactly as in the proof of (7.4). 
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8. Proof of the Weighted Kato Theorem 



To complete the proof of Theorem 1.1 we will construct a finite index set {z/} and 
for each cube Q C W 1 a family of functions Fq >v : 5Q — > C that satisfy the hypotheses 
of Lemma 7.2. To do so we adapt the proof of the non- weighted case in [19]. 

Recall that (f (x) = x. Given a cube Q C W 1 define the function 



(8.1) 



Fn(x) 



-e 2 e(Q) 2 C, 



<p{x) 



where e > will be chosen below. The set {z/} will be a finite collection of vectors in 
C\ \v\ = 1, also to be chosen below. Define 



Fq,u = F q -v. 



Since \Fq jU \ < \Fq\, and similarly for the gradient, to prove (I) and (II) in Lemma 7.2 
it will suffice to prove that Fq satisfies (ii) and (iii) in Lemma 7.1. To prove (iii): 
from (7.3) we have 



tC 



~t 2 C 



ip{x; 



\V*p(x)\ 



V t (x, y) y dy 



< C < oo 



for some constant independent of t. Then 

1 , 1 



\C w Fq (x) 



\V e t(Q)<p(x) 



e£(Q) r^^n £i(Q) 
Since w G A 2 it is a doubling measure, and so 



Vet{Q) (x,y)ydy 



< 



C 



et (Q) 



(8.2) 



/ \C W F Q (x)\ 2 w (x) dx < (Q) £ (Q)- 2 . 



This proves (iii). 

To prove (ii), fix rj e (M n ) such that rj = 1 in 5Q, T] = in E n \ 10Q, Wr}]]^ < 1 
and ||V^|| < C£(Q)~ . Then, by the ellipticity condition (1.1), 



(8.3) / \VF Q {x)\ 2 w{x)dx 

'5Q 



<[ \VF Q {x)\ 2 7 1 {x) 2 w{x)dx< \- 1 [ A(x)\/Fq(x) ■ VFqI 



x)r](x) 2 dx. 



By the conservation property, Lemma 2.1, e * 2£ ™1 = 1; hence, V ' x e t2Cw l = and 
so we can write 



V x e * Cw cp(x) = V x / W t 2 (x,y)ydy 



V x / W t 2 ( x , y){y-x)dy + l = V x G t {x) + 1, 
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2/ 



where G t (x) = (e 1 Cw - l)tp(x). Thus VF Q = 1 + VG £ ^ Q ). By the Gaussian 
decay (1.3) of the heat kernel, 

(8.4) \G t (x)\ < ^ |exp ^-C 2 ^Z_^! j |„ - x | dy < Cf. 

Integrating by parts, we have that 

A(x)VF Q (x) ■ VG ee{Q) (x)ri(x) 2 dx 

A(x)VF Q (x) • V{G em r, 2 ){x) dx 

-2 A(x)VF Q (x) ■ G em {x)Vr]{x)r]{x) dx 

C w F Q (x)G £ e( Q )(x)r](x) 2 dx 

-2 A(x)VF Q (x) ■ G em {x)Vr]{x)r]{x) dx. 

If we combine this with (8.3) we get that 
/ \VFQ(x)\ 2 w(x)r](x) 2 dx 

< A" 1 f A(x)VF Q (x) • (1 + VG em {x)) i]{x) 2 dx 

< A" 1 / \A(x)VF Q (x) ■ l\rj(x) 2 dx 

+ A _1 / \C w F Q (x)\\G £ ^ Q )(x)\r](x) 2 w(x) dx 

+ 2A- 1 f T]{x)\A{x)VF Q (x) ■ Vr]{x)\\G £ e iQ) (x)\dx. 
By ellipticity, the inequality |2a6| < 5a 2 + <5 _1 6 2 , 5 > 0, and (8.4) we get 
|V FQ(x)\ 2 w(x)r](x) 2 dx 

- ~TT / \VFq(x)\ 2 w(x)t)(x) 2 cIx + j r] 2 w(x)dx 
2* Jr™ 2ao J R n 

+ c f!f!M! f \C w F Q (x)\ 2 w(x)dx+^- [ 7](x) 2 w(x)dx 



A ./10Q A •'IK™ 
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+ ^ / \VF Q {x)\ 2 V {x) 2 w(x)dx+^-e 2 £(Q) 2 [ \V V {x)\ 2 w{x) dx. 

* JR n ™ JlOQ 

If we apply (iii), use the fact that s < 1, | Vr/| « ^(Q)^ 1 and w is doubling, and if we 
take 5 > sufficiently small, then 

|VF Q (x)| 2 w(x)r/(x) 2 c/x < ~ / \VF Q (x)\ 2 w(x)r](x) 2 dx + C^-w(Q). 

Re-arranging terms and using the fact that rj = 1 on 5Q, we get (ii). 

To prove that (III) holds we make two reductions. First, recall that the averaging 
operator A t is defined by 

Af(x) = / f(y) dy, 

JQt(x) 

where Qt{x) is the unique dyadic cube containing x such that t < £(Qt{x)) < 2t. 
As before, let x(x) e C£° (W 1 ) be such that supp(x) C 4Q, x\ 3Q = 1 and |Vx| < 

C£(Qy\ Then P t f (x) = P t xf (x), and A t f (x) = A t xf (x) for all x e Q and t, 
< t < i (Q). Since P t commutes with the gradient, by Lemma 4.2 we have that 

r rHQ) fit 
/ / \ lt {x)VP t F Q , v (x)\ 2 -w{x)dx 
Jq Jo 1 

r r e( - QS > dt 

/ / \ lt (x)(P t -A t )xVF Q , u (x)\ 2 T w(x)dx 
Jq Jo 1 

r r e{Q) dt 

/ / \ lt (x)A t VF Q , v {x)\ 2 -w(x)dx 
Jq Jo 1 

r rt(Q) dt 
<sup||7i|lL/ / \{P t -A t )xVF Q ,u{x)\ 2 -w{x)dx 
t>o Jo Jo 1 



'Q 
+ 



r r i{Q) dt 

+ / / \ lt {x)A t VF Q , u {x)\ 2 -w{x)dx 
Jq Jo 



t 



<Csuv\\ lt \\l [ \VF Q>v {x)fw{x)dx 

t>0 J$Q 

f f e{Q) dt 
+ / / | 7t (x)A t VF Q , v (x)\ 2 —w(x)dx. 
Jq Jo * 

Therefore, by (I) we have 

1 f f eiQ) 2 dt 

~Tn\ / / \lt{x)VP t F Qtl/ (x)\ 2 —w{x)dx 

W {^) Jn Jo I 
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< C sup htll^ + ——- / / \^ t (x)AtVF Q>v {x)\ — w(x)dx, 
t>o w [Q) J Q Jo t 

and so to prove (III) it is enough to establish this estimate with P t replaced by A t . 

For our second reduction, we need a lemma that was proved in [19] in the un- 
weighted case; essentially the same argument works in the weighted case. 

Lemma 8.1. Suppose that there exist < rj < 1 such that for all cubes Q, there 
exists a subset Eq C Q with w(Eq) > r\ w{Q), and Q\Eq = [jQj, where {Qj} 
is a set of non- overlapping dyadic sub-cubes of Q. Let Eq = Rq\[JRq. , where Rq 
denotes the Carleson rectangle Q x (0,£(Q)) above Q. If for every cube Q, 

^ (E* Q ) <C lW (Q) , 

then fi is a Carleson measure with respect to w, and 

n(Q*)<—w(Q). 
V 

Therefore, to complete our proof that (III) holds, we need to find our finite index 
set {v} and construct sets Eq as in Lemma 8.1 such that 

( 8 - 5 ) -JTysII \lt(x)\ 2 ^w(x)dx 

l r r e( - QS > dt 

< Csup|| 7t ||^ + C V— - / / \ lt {x)VA t F QtU {x)\ 2 -w{x)dx. 
t>o „w{Q)JqJ t 

We have now come to "the heart of the matter," as was said in [19]. For every v £ C n 
with \v\ — 1 define the cone 

T u = {z £ C n :\z-v{z- v)\ <e\z-u\}. 

Clearly, for each e > there exists a positive integer N = N (e, n) and unit vectors 
V\ , . . . , £ C n such that C n C U J= i Below we will fix our e and will then let 
{u} = {uj}^ =1 be our index set. 

We will first construct the sets Eq. To do so, we will construct sets Eq^ v that 
depend on v\ the desired set Eq will be the union of these sets over our index set 
{z/}. We need two more lemmas. The first is from [19]; since its proof depends only 
on the Gaussian bounds for the kernel of e~ tCw , it holds in the weighted case without 
change. 
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Lemma 8.2. There exists a constant C > depending only on the constants C\, C 2 
in the Gaussian bounds, such that for any cube Q C MJ 1 , 



{ 

Jo. 



(VF Q (x) - 1) dx 



< Ce. 



The second lemma gives two properties of A 2 weights. The first is the well-known 
condition, and the second is closely related. For a proof see [17, 18]. 

Lemma 8.3. If w G A 2 , there exists constants a, 5 > and constants /3, e > such 
that given any cube Q and measurable set E C Q, 



w{E) 

MO) 



<ai w\) and u- p 



w{E) 



To prove (III) we first construct the sets {Eq^} via the stopping time argument 
used in [19]. We include the details to show how the proof adapts to the weighted 
case. Let S\ be the collection of all maximal dyadic cubes Q' C Q such that 



(8.6) 



Re/ v-VF QiV {y)dy<-, 
Jq' 4 



and let B 1 = Bq u = [j Si Q'. Similarly, let <S 2 be the collection of maximal sub-cubes 
of Q such that 



(8.7) 

and let B 2 
Lemma 8.2, 



and thus 



Jq' % £ 
Km = Us 2 Q'- Set B = B Q , U = B 1 {JB 2 and E Q>V = Q\B. By 



[v ■ VF QtU {y) dy 



j{VF Q {y)-l)d y yv 



< Ce, 



(8.8) (1 - Ce) \Q\ < Re / v ■ VF QiV {y) dy 

JQ 

[ vVF QtV (y)dy+ I v ■ V F Q , v (y) dy + [ v ■ V 'F QjU (y) dy) . 



By (8.6) and the definition of B 1 , 



(8.9) Re ! v ■ VF Q>v (v) dy = ReJ2 f / ' ^ F qAv) d V <\H \Q' 
By Holder's inequality, (I) and the definition of A 2 weights, 
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(8.10) Re/ v-VF Q)V {y)dy 

Similarly, by the definition of B 2 and (I), 

(8.11) \B 2 \ < V|Q'|<8eV / \VF Q M\ dy = 8e [ \VF Q>1/ {y)\ dy 

s 2 S2 Jq> J& 

< 8e (J ^ \WF Q M\ 2 w(y)dy^ w~ l (B 2 ) 1/2 

< 8Cew (Q) 1/2 w' 1 (Q) 1/2 < 8Ce [w] M \Q\ = Ce \Q\ 
By Lemma 8.3 we may assume that e is so small that (8.11) implies 



w~ l (B 2 ) ( 1/8 



w-HQ) ~ \l + C[w] Aa . 
Combining this estimate with (8.10) we obtain 

Re / v-VF Q , v {y)dy<\\Q\, 
and putting this together with inequalities (8.8) and (8.9), for e small enough we get 



^\Q\<Re^ vVF Q Mdyj. 



But then, since w G A%, 



^\Q\<ReU u-VF Q , v (y)dy 



<c( w (q) w -i(e q , v )) 1/2 <c\q\( w ~ 1{Eq >^ 



w- 1 (Q) 

Since w~ l is also in A2, by Lemma 8.3 (applied twice) there exists r\ > such that 
(8.12) w (E QjV ) >rjw(Q). 
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Now write Bq >v = Q\Eq v = {JQ u j, where the Q v j are disjoint maximal dyadic 
cubes. Let Eq = Rq \ {JRq„j be the sawtooth region above Eq^ v . If (x, t) e Eg , 
and Q t (x) is the biggest dyadic sub-cube of Q containing x with t < £ (Q t (x)) < 2t, 
then by the maximality of the cubes in Si and S2 we have that Q t (x) ^ £>; hence 

- < Re / v ■ VF QiV {y) dy<-f \VF Q ,„(y)\ dy < 1. 
4 JQt(x) JQ t (x) 

By the definition of A t , this implies that 

- < Rev ■ A t VF QtU (x) < \A t VF QtU (x)\ < —. 

4 oE 

By the definition of T u , if z e T u , then \z\ < (1+e)\z -P\. Hence, for e > sufficiently 
small, we have that 

\z ■ A t F Q;U {x)\ >\{z-u){v A t F QtU {x))\ - \{z - u{z ■ U) ■ A t F Q>l/ {x)\ 

> -\z ■ v\ - e\z ■ v\\AtF Q)V {x)\ > -\z ■ v\ > -\z\. 

4 o Z 

Now fix e small; form our index set {u} = {vj}^ =l as described above, and let 
Eq = y . Eq^. Therefore, if we let z = jt(x), then 74 (x) G T Uj for some 1 < j < N, 
and we have 

1 N 

- A \lt{x)\ 2 <Y,\lt{x) ■ A t WF Q ^{x)\\ 
for all (x,t) e E Q . It follows that 

|7j(x)| 2 — w(x) dx 



w{Q) JJe* q t 

^^yE// \lt(x)-A t VF Q Ax)\ 2 ^w(x)dx 

I f m \lt(x)-A t VF Q , 1/ (x)\ 2 ^w(x)dx, 
W W) - =1 Jq Jo 1 

where, by (8.12), w(Eq) > r]w(Q). This proves (8.5); thus we have shown that (III) 
in Lemma 7.2 holds, and so have completed the proof of Theorem 1.1. 
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